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1  Introduction 

STEPparse,  the  NIST  STEP  physical  tile  parser,  and  the  asvxiated  STEP  Worksn^c 
Form,  are  Public  Domain  iool.s  for  manipulating  prcxluci  mtxicls  stored  in  the  STEP 
physical  file  format  { AltemuellerHSi.  These  tools  arc  a  pan  of  ihc  NIST  PDFS  Tisi.iik.: 
[C!ark9(3al,  and  are  geared  panicularly  toward  building  STEP  translators  The  S  TE  P 
Working  Form  is  an  in-memory  representation  for  STEP  product  mexScis  It  rches  on 
the  NTST  Express  Working  Form  [CIark90b!  as  an  m-core  data  dictionan .  which  pro 
vides  a  context  in  which  STEP  mcxlels  can  be  interpreted  The  Working  Form  ccxic  and 
the  STEPparse  parser  itself  arc  both  wnticn  to  be  independent  of  any  ponicuiar  S4.  hem;i 
simply  plug  in  some  Express  language  information  model  ISchcnck9<)!,  and  the  ccxle  ss 
ready  to  run. 

A  primary  goal  in  the  development  of  STEPparse  w  as  to  provide  a  clean  back  end  in¬ 
terface  which  would  allow  various  output  modules  to  be  easily  plugged  into  the  basic 
front-end  parser.  To  accomplish  this,  the  parser  builds  up  a  set  of  data  structures  i;he 
STEP  Working  Form)  containing  ail  of  the  information  in  a  STEP  source  ftic  It  can 
then  dynamically  load  one  or  more  output  modules.  Each  module  walks  through  the 
W'orking  Form,  extracting  relevant  subsets  of  the  available  data  and  producing  an  ap¬ 
propriately  formatted  output  file.  Three  STEPparse  output  modules  are  provided  with 
the  MST  PDES  Toolkit:  one  which  produces  SmaiUalk-80^  objc>-i  instantiations,  one 
which  produces  a  STEP  physical  fUc  (so  the  the  Working  Form  can  be  used  to  translate 
to  as  well  as  from  STEP),  and  one  which  toads  an  SQL  database  from  the  STEP  Work 
ing  Form  {Nicker5on90].  The  former  is  used  by  QDES  [ClarkdOdj.  a  prototype  STEP 
model  editor  written  in  Smalltalk-80. 

1.1  Context 

The  PDES  (Product  Data  Exchange  using  STEP)  activity’  is  the  fnited  States'  effon  tn 
support  of  the  Standard  for  the  Exchange  of  Fh-oduct  Model  Data  (STEP),  an  emerging 
international  standard  for  the  interchange  of  product  data  between  vanous  vendors' 
CAD/CAM  systems  and  other  manufactunng-relatcd  software  [Smith881,  A  National 
PDES  Testbed  has  been  established  at  the  National  Institute  of  Standards  and  Techno! 
ogy  to  provide  testing  and  validation  facilities  for  the  emerging  standard.  The  Testbed 
is  funded  by  the  CALS  (Computer-aided  Acquisition  and  Logistic  Suppon)  program  of 
the  Office  of  the  Secretary  of  Defense.  As  pan  of  the  testing  effort.  NIST  is  charged 
with  providing  a  software  toolkit  for  manipulating  PDES  data.  This  NIST  PDES  Tool¬ 
kit  is  an  evolving,  research-oriented  set  of  software  tools.  This  document  is  one  of  a  set 
of  reports  which  describe  various  aspects  of  the  Toolkit.  An  overview  of  the  Toolkit  is 
provided  in  [Clark90aJ,  along  with  references  to  the  other  documents  in  the  set. 
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2  Implementation  Environment 

STEPparse  and  the  STEP  Working  Form  were  developed  on  Sun  St;vru%%  stc'T’v  Su'i 
and  Sun-4‘'’  workstations  panning  sne  L  nix'"  operating  ssMcm  Fhc  parser  :  '  nr. 
plemented  tn  Vacc  and  Lex.  the  Unix  tools  tor  generating  parsers  and  icxicai  atuis  /ers 
The  Working  Form  data  structures  arc  implerncntesl  in  ANSI  Standard  C  t  ^NSls-o 

The  grammar  I'or  the  language  is  processed  b>  Bison,  the  Free  Sottware  Foaiidan..'-; 
implementation  ot  the  Yacc  parser  generator  The  lexical  anaiv/er  is  prtxiuved  h> 

Flex',  a  fast,  public  domain  implementation  of  i-ex  The  C  compiler  used  is  GCC  ah 
a  product  of  the  Free  Softw  are  Foundation,  although  the  Vk  orking  Form  ctxic  cck-  r  .  n 
specifically  depend  on  any  panicular  compiler 

3  Running  STEPparse 

STEPparse  takes  several  optional  command-ime  arguments: 

STEPparse  - d  '..na-cer>. 

'  -  ^  X  o  r  ^  s  s  ^ ' 

>s  <szep>'_ 

The  -d  option  controls  the  debugging  level,  the  argument  can  range  from  0  tthc  dc 
fault)  to  10.  The  E.xprcss  schema  file  is  specified  with  -e.  if  no  -e  option  is  given,  the 
schema  is  read  from  standard  input.  The  STEP  input  file  is  specified  with  - :  .  again, 
standard  input  is  read  if  there  is  no  -s  opoon.  .At  least  one  of  or  *  5  must  be  '•pec 
ifted;  STEf^arsc  cannot  read  both  from  scandaird  input. 

STEPparse  can  be  built  in  two  different  ways,  resulting  tn  different  interaction  patterns 
For  many  applications,  a  single  output  module  is  bound  into  the  translator  at  build  time 
In  this  statically  linked  case,  after  the  STEP  source  file  has  been  parsed  the  user  is  nor 
maiiy  prompted  for  a  single  file  name.  This  is  the  name  of  the  file  to  w  hich 
STEPparsc’s  output  w-ill  be  written.  In  the  other  (dynamically  linked)  version,  no  spe 
cific  output  module  is  loaded  at  build  time.  In  this  case,  after  parsing  us  input,  the  pro¬ 
gram  asks  for  an  output  module.  If  the  file  named  is  an  appropnatc  object  file,  it  is 
loaded  and  an  output  file  name  requested,  which  is  where  the  output  w  ill  be  wntten 
Another  output  module  is  then  requested,  and  this  sequence  continues  until  the  user  en 
ters  an  empty  line  as  the  name  of  the  output  module,  which  signals  STEPparse  to  exit 
This  dynamic  loading  facility  is  only  available  under  BSD4.2  Unix  and  us  denvaics 


1.  The  Free  Software  Foundation  (TSF)  of  Cambridge.  Mas.sachusctLs  is  responsible  for  the  GNT  Project, 
whose  ultimate  goal  is  to  provide  a  free  implementation  of  the  Unix  operaung  system  and  environment 
These  tools  are  not  m  the  Public  Domain:  FSF  retains  ownership  and  copynght  pnvileges.  but  grants  free  dis¬ 
tribution  nghis  under  certain  terms.  At  this  wnung.  further  mformauon  is  available  by  electronic  mail  on  the 
Internet  from  gnu@prep.ai.mit.edu. 

2.  Vem  Paxson’s  Fast  Lex  is  usually  distributed  with  GNX'  software.  It  is,  however,  m  the  Public  Domain, 
and  is  not  an  FSF  product,  "nius.  it  does  not  come  under  the  FSF  licensing  restncuons 
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4  Design  Overview 

The  STEP  Working  Form  (WFi  is  designed  in  an  objcct-oncnicd  fashion,  and  is  inicnd 
ed  to  mesh  cleanly  with  the  N'lST  Express  Working  Fonn  Indeed,  the  F  s  u-Tcniis 
relies  on  the  structures  of  the  Express  Working  Form  as  an  in-menx)r>  data  dictionan 
This  section  discusses  the  design  of  the  Working  Form,  describing  STEPpaise  control 
How  as  well  as  the  data  abstractions  of  the  WF.  More  technical  detail  can  be  found  m 
[Clark90cl. 

4.1  STEPparse  Control  Flow 

.A  STEPparse  translator  consists  of  two  separate  passes:  parsing  and  output  gcneranon 
The  first  pass  builds  an  instantiated  product  model  from  a  STEP  source  nie  This  traxlel 
can  then  be  Q-avcrsed  by  an  output  module  m  the  second  pass,  prcxlucmg  whatever  re 
pon  is  desired. 

.As  currently  implemented,  STEPparse  must,  m  fact,  parse  an  Express  schema  (with 
Fed-X)  before  u  can  interpret  the  constructs  m  a  STEP  physical  file.  To  do  this. 
STEPparse  first  invokes  Fed-X’s  first  two  passes  to  build  a  data  dictionary  ,  and  then 
proceeds  to  parse  its  STEP  source  file. 

4.2  Working  Form  Data  Structures 

The  STEP  Working  Form  consists  of  two  data  abstractions.  The  Instance  abstraction 
represents  individual  entity  instances  in  a  product  model,  as  well  as  aggregates  and  un 
structured  values  (integers,  booleans.  etc.).  A  more  object-onented  design  would  clear¬ 
ly  break  these  down  into  several  separate  subclasses  of  Instance;  implementation 
considerations  have  resulted  in  a  single  module.  The  second  abstraction  represents  a 
complete  product  model.  This  basically  consists  of  an  ordered  collection  of  Instances 
and  an  Express  model  to  give  it  a  context.  The  Working  Form  currently  does  not  record 
header  information  (as  found  in  STEP  physical  files),  although  this  would  ccnainly  be 
useful. 

4.2.1  Instance 

As  mentioned  above,  the  Instance  abstraction  is  really  the  union  of  several  other  con-^ 
ceptuaJ  classes,  representing  entity  instances,  simple-typed  data  (integer  values,  bool¬ 
eans,  etc.),  and  various  kinds  of  aggregates.  Most  of  the  access  functions  for  this 
abstraction  are  restricted  to  act  on  Instances  of  cenain  classes,  which  indicates  very 
clearly  the  need  for  this  module  to  be  broken  down  into  its  component  classes;  this  has 
not  been  done,  primarily  because  of  limitations  of  the  implementation  language.  C. 

Cenain  attributes  are  common  to  all  Instances.  For  example,  each  instance  is  marked 
with  a  Type,  which  determines  the  context(s)  in  which  it  can  be  used.  The  Type  also 
provides  an  interpretation  for  the  Instance's  value.  A  user  data  field  is  provided  so  that 
an  arbitrary  C  pointer  can  be  associated  with  each  Instance  in  an  instantiated  model. 

This  allows  a  Working  Form  model  to  be  linked  to  the  internal  data  structures  of  a  solid 

modeler,  for  example. 


f’agc  ' 
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Addinonally,  every'  Instance  has  a  value  tleid  The  type  of  this  held  vxnes  v*.  ideiy  v,  -.‘n 
the  type  of  the  instance,  but  there  are  three  pnmary  classes:  simple  tunstrut  tured  a, 
ues,  aggregates,  and  entity  instances.  Esampies  of  Instances  ssith  simple  s  alecs  arc 
numbers,  smngs,  and  booleans  These  instances  each  have  a  single,  atomic  value  oi  '.t'.c 
corresponding  C  type  tir.z.  .r.nar* ,  etc.  >  .\n  aggregate  t  which  may  txr  an  arras  r'.ic. 
list,  or  set)  consists  of  a  coilccuon  of  value.,  each  of  which  is  uself  an  Instance  Ti:c*-t 
elements  can  be  accessed  via  indexing,  with  valid  indices  ranging  between  Knacr  .me. 
upper  bounds  specified  by  the  Express  mode!.  Note  that  these  bounds  are  in'er''rciccl 
differently  for  different  classes  of  aggregates  in  Express.  The  bounds  dircctiv  '  pemf. 
the  range  of  allowable  indices  for  an  array,  while  they  limit  the  size  of  other  aggregate 
types.  Thus,  indices  for  list.  bags,  and  sets  range  from  0  to  the  cunent  size  ot  the  ,sg 
gregate,  which  in  turn  must  fall  between  the  upper  and  lower  bssunds  Tfie  Express  ;an 
guage  also  specifics  typc-spccific  operations  for  each  class  of  aggregatev,  such  as 
intersection  and  union  of  sets  and  bags,  and  list  concatenation.  These  operations  .irc 
provided  by  the  STEP  WF  as  the  preferred  mode  of  interaction  with  aggregate  inst.inc 
es.  .An  entity  instance's  value  again  consists  of  a  collection  of  Instances  These  are  ac 
cessed  by  name,  using  the  attnbute  names  from  the  entity's  class 

Finally,  an  Instance  may  have  a  name.  Normally,  only  external  (non-embeddcc !  ent; 
ties  will  be  named;  all  other  Instances  will  have  N’JLL  names  This  is  due  to  the  usage 
prescribed  by  the  STEP  Physical  File  format:  an  embedded  entity  cannot  be  referenced 
outside  of  the  immediate  context  in  which  it  is  defined,  and  so  has  no  need  for  a  name 
An  external  entity,  on  the  other  hand,  can  be  referenced  by  any  other  entity  in  a  prixluct 
model.  This  reference  requires  the  entity's  name  as  a  handle. 

4,2.2  Product 

The  Product  abstraction  ties  things  together  m  the  STEP  Working  Form.  This  mrxlule 
is  used  to  represent  a  STEP  product  model  as  a  whole.  A  Product  consists  of  a  collec¬ 
tion  of  (presumably  interconnected)  Instances  and  an  Express  tur.ccptual  schema  lo 
give  these  Instances  context.  This  schema  serves  as  a  data  dictionary  for  the  Product. 
External  entities  in  a  Product  can  be  looked  up  by  name;  other  Instances  can  only  be 
retneved  by  coming  upion  them  as  components  of  known  Instances. 

Externally,  a  Product  looks  like  a  somewhat  intelligent  conuincr  object.  New  Inst;inces 
can  be  added  to  this  container,  and  existing  Instances  can  be  retneved  from  it  by  name 
Additional  functionality  can  be  gained  from  the  attached  Express  information  model 

5  Missing  Features 

Current'  /,  the  Working  Form  does  not  handle  user-defined  entities.  STEPparse  accepts 
user-defined  entities  in  a  source  file,  and  prints  a  warning  message  indicating  that  they 
cannot  be  represented  in  the  Working  Form. 

As  mentioned  above,  file  header  information  from  PDES/STEP  physical  files  is  not  re¬ 
tained  in  the  Working  Form,  although  STEPparse  silently  accepts  file  headers. 
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Aggregates  with  non-constant  expressions  as  bounds  are  not  handled  properly  Such 
an  aggregate’s  type  information  accurately  reflects  the  true  upper  bound,  but  the  STEP 
WF  routines  treat  the  bound  as  if  a  were  unspecified.  Since  unbounded  aggregates  ,irc 
dynamically  sized,  this  does  not  cause  memory  management  problems;  the  onh  dru  ^ 
back  IS  that  the  Working  Form  docs  not  enforce  sizx  constraints  on  such  aggregates 

Comments  are  currently  discarded  during  lexical  analysis,  and  so  currently  have  no 
chance  to  be  recorded  by  the  parser.  There  has  been  some  interest  in  developing  a 
mechanism  through  which  applications  which  modify  STEP  physical  files  can  preser.  c 
comments  found  in  the  input  file. 

6  Conclusion 

The  combination  of  the  STEP  Working  Form  with  an  Express  Working  Form  data  die 
tionary  provides  a  flexible  mechanism  for  performing  various  manipulations  of  STEP 
data  in  a  schema- independent  manner.  Although  it  remains  to  be  seem  how  useful  this 
schema-independence  will  ^.e  in  higher-level  end-user  applications  (e.g..  design  editors, 
configuration  management  systems,  and  process  planning  systems),  the  present  archi¬ 
tecture  is  quite  useful  for  such  genenc  tasks  as  translation  and  database  loading. 

For  further  information  on  STEPparse.  the  STEP  W'orking  Form,  or  Oioer  components 
of  the  Toolkit,  or  to  obtain  a  copy  of  the  software,  use  the  attached  order  form. 


The  NIST  Working  Form  for  STEP 


Page  5 


Stephen  SiiiAuisO  ClirK 


A  References 

[Altemueller88) 
(A  NS  1891 
[Ciark90a| 

[Clark90b! 

[Clark90c! 

[Clark90dl 

[Nickerson90] 

[Schenck90| 

[Smith88] 


Altemueller.  J..  The  STEP  File  Sirucmre.  !SO  TCI  84, SC4.U  G 1 
Document  N279.  September,  1988 

American  National  Standards  Institute,  ProgTammtm;  Lanciiace  C. 
Document  ANSI  X3. 159- 1989  . . 

Clark.  S.  N’.,  An  Introduciion  to  The  NIST  PDES  Twikn.  NJSTIR 
4336,  National  Institute  of  Standards  and  Technology,  Gaiiher'.burc. 
MD,  May  1990 

Clark,  S.N..  Fed-X:  The  NIST  Express  Translator.  NISTIR  4'"”;, 
National  Institute  of  Standards  and  Technology.  Gaithersburg.  MD 
August  1990 

Clark.  S.N..  NIST  STEP  Working  Form  Programmer's  Retergr.v;e. 
NISTIR  4353.  National  Institute  of  Standards  and  Technology. 
Gaithersburg,  MD,  June  1990 

Clark.  S.N.,  ODES  User’s  Guide.  NISTIR  4361  ,  National  Institute 
of  Standards  and  Technology,  Gaithersburg.  .MD.  June  1990 

Nickerson,  D..  The  NIST  SOL  Database  Loader:  STEP  W'orkir.ii 
Form  to  SOL,  NISTIR  4337,  .National  Institute  of  Standards  and 
Technology.  Gaithersburg.  MD.  .Vlay  1990 

Schenck.  D..  ed..  Exchange  of  Product  Model  Data  •  Pan  i ! :  The 
Express  Language.  ISO  TC184/SC4  Document  .N64.  July  1990 

Smith,  B.,  and  G.  Rinaude*.  cds..  Product  Data  Exchange 
Specification  First  Working  Draft.  NISTIR  88-4004,  National 
Institute  of  Standards  and  Technology,  Gaithersburg,  MD. 
December  1988 


The  NIST  Working  Form  for  STEP 


Paec  6 


Nauonal  Insuiuie  of  Siandards  and  Technology 

Gaithersburg  MD,.  208^9 

Metrology  Building,  Rm-A127 

Attn.  Secretary  Nationai  PDES  Testbed 

(301)975-3508 


Please  send  the  following  documents 
and/or  software: 

j  I  Clark.  S29..  An  Introduction  to  The  NIST  PDES  Toolkit 

I  I  Claric.  S  J9..  The  NIST  PDES  Toolkit:  Tecnnical  Fundamentals 

j  I  Clark.  S  JV..  Fed-X:  The  bflST  Express  Translator 

Q  Clark.  S  The  NIST  Working  Form  for  STEP 

Q  Clark.  S  J»l..  NIST  Express  Working  Form  PrQgramrner's  Reference 

(  I  Clark,  S.N.,  NIST  STEP  Working  Form  Proerammer's  Reference. 

Q]  Clark.  S  J9..  ODES  User’s  Guide 

I  I  Clark.  S.N..  ODES  Administrative  Guide 

I  [  Morris.  K.C..  Translating  Express  to  SOL:  A  User’s  Guide 

I  j  Nickerson.  D.,  The  NIST  SQL  Database  Loader:  STEP  Working  Form  to 
SQL 

j  I  Slrouse,  K.,  McLay,  M.,  The  PDES  Testbed  User  Guide 
OTHER  (PLEASE  SPECIFY) 


NATIONAL  __ 


TESTBED  ~ 


These  documents  and  corresponding  software  will  be 
available  from  NTIS  in  the  future.  ’'Mien  available,  the 
NTIS  ordering  information  will  be  forthcoming. 
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